Accelerating XPath Evaluation against XML Streams

نویسنده

  • Dan Olteanu
چکیده

Data streams are an emerging technology for data dissemination in cases where the data throughput or size make it unfeasible to rely on the conventional approach based on storing the data before processing it. Areas where data streams are applied include monitoring of scientific data (astronomy, meteorology), control data (traffic, logistics, networks), and financial data (bank transactions). Querying XML data streams without storing and without decreasing considerably the data throughput is especially challenging because XML streams convey tree structured data with (possibly) unbounded size and depth. This paper presents a non-trivial, yet natural and simple extension of the SPEX query processor for XML streams [1]. This extension, called structural filters, exploits the structural relationships between the XML fragments encountered along the stream at various processing states in order to skip large stream fragments irrelevant to the query answer. Experimental evidence shows clearly the efficiency of SPEX with structural filters over the normal SPEX. These structural filters are demonstrated together with SPEX using SPEX Viewer [1], a sophisticated visualization of the SPEX run-time system. More precisely, SPEX Viewer visualizes step-by-step query rewriting, query plan generation, incremental stream processing, and answer generation.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Towards a Streamed XPath Evaluation

XPath is a language for addressing fragments of XML documents, used in query and transformation languages such as XQuery and XSLT. For many applications it is desirable to process XPath on the fly and progressively against data streams. This diploma thesis is devoted to streamed and progressive evaluation of XPath. A streamed and progressive XPath evaluation considerably reduces the needed memo...

متن کامل

An Evaluation of Regular Path Expressions with Qualifiers against XML Streams

This paper presents SPEX, a streamed and progressive evaluation of regular path expressions with XPath-like qualifiers against XML streams. SPEX proceeds as follows. An expression is translated in linear time into a network of transducers, most of them having 1-DPDT equivalents. Every stream message is then processed once by the entire network and result fragments are output on the fly. In most...

متن کامل

Evaluation of XPath queries against XML streams

XML is nowadays the de facto standard for electronic data interchange on the Web. Available XML data ranges from small Web pages to ever-growing repositories of, e.g., biological and astronomical data, and even to rapidly changing and possibly unbounded streams, as used in Web data integration and publish-subscribe systems. Animated by the ubiquity of XML data, the basic task of XML querying is...

متن کامل

FluXQuery: An Optimizing XQuery Processor for Streaming XML Data

XML has established itself as the ubiquitous format for data exchange on the Internet. An imminent development is that of streams of XML data being exchanged and queried. Data management scenarios where XQuery [11] is evaluated on XML streams are becoming increasingly important and realistic, e.g. in e-commerce settings. Naturally, query engines employed for stream processing are main-memory-ba...

متن کامل

Efficient Evaluation of Forward XPath Axes over XML Streams

Although many algorithms have been proposed for evaluating XPath queries containing un-ordered axes (child, descendant, parent and ancestor) against streaming XML data, there are very few efforts towards developing algorithms for processing path expressions with ordered axes (following, followingsibling, preceding and preceding-sibling). In this paper, we show how order information can be built...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006